Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection
نویسندگان
چکیده
Mispronunciation detection and diagnosis are part and parcel of a computer assisted pronunciation training (CAPT) system, collectively facilitating second-language (L2) learners to pinpoint erroneous pronunciations in a given utterance so as to improve their spoken proficiency. This thesis presents a continuation of such a general line of research and the major contributions are three-fold. First, we compared the performance of different pronunciation features in mispronunciation detection. Second, we propose an effective training approach that estimates the deep neural network based acoustic models involved in the mispronunciation detection process by optimizing an objective directly linked to the ultimate evaluation metric. Third, we can linearly combine two F1-score when we consider F1-score as final objective function. It can effectively deal with the label imbalance problem. A series of experiments on a Mandarin mispronunciation detection task seem to show the performance merits of the proposed methods.
منابع مشابه
Mispronunciation Detection Leveraging Maximum Performance Criterion Training of Acoustic Models and Decision Functions
Mispronunciation detection is part and parcel of a computer assisted pronunciation training (CAPT) system, facilitating second-language (L2) learners to pinpoint erroneous pronunciations in a given utterance so as to improve their spoken proficiency. This paper presents a continuation of such a general line of research and the major contributions are twofold. First, we present an effective trai...
متن کاملContext Aware Mispronunciation Detection for Mandarin Pronunciation Training
Mispronunciation detection is an important component in a computer-assisted language learning (CALL) system. Many CALL systems only provide pronunciation correctness as the single feedback, which is not very informative for language learners. This paper proposes a context aware multilayer framework for Mandarin mispronunciation detection. The proposed framework incorporates the context informat...
متن کاملMispronunciation detection for Mandarin Chinese
In this paper, we propose several reliable weighting factors based on the speaker’s proficiency level, which can be used to normalize the scaled log-posterior probability (SLPP) to further improve mispronunciation detection at syllable level for Mandarin Chinese. Experiments based on a database consisting of 8000 syllables, pronounced by 40 speakers with varied pronunciation proficiency, shows ...
متن کاملAn Application of Modified Confusion Network for Improving Mispronunciation Detection in Computer- aided Mandarin Pronunciation Training
In this paper, we propose an application of confusion network for Mandarin mispronunciation detection. Compared to former published works, which are proven to work effectively and robustly in detecting mispronunciation in word level and only successfully detect mispronunciation in sentence level in strictly small constrained search space, our modified confusion network based Computer-aided Pron...
متن کاملA new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models
This paper presents two new ideas for text dependent mispronunciation detection. Firstly, mispronunciation detection is formulated as a classification problem to integrate various predictive features. A Support Vector Machine (SVM) is used as the classifier and the loglikelihood ratios between all the acoustic models and the model corresponding to the given text are employed as features for the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJCLCLP
دوره 21 شماره
صفحات -
تاریخ انتشار 2016